How to track and assess genotyping errors in population genetics studies.
نویسندگان
چکیده
Genotyping errors occur when the genotype determined after molecular analysis does not correspond to the real genotype of the individual under consideration. Virtually every genetic data set includes some erroneous genotypes, but genotyping errors remain a taboo subject in population genetics, even though they might greatly bias the final conclusions, especially for studies based on individual identification. Here, we consider four case studies representing a large variety of population genetics investigations differing in their sampling strategies (noninvasive or traditional), in the type of organism studied (plant or animal) and the molecular markers used [microsatellites or amplified fragment length polymorphisms (AFLPs)]. In these data sets, the estimated genotyping error rate ranges from 0.8% for microsatellite loci from bear tissues to 2.6% for AFLP loci from dwarf birch leaves. Main sources of errors were allelic dropouts for microsatellites and differences in peak intensities for AFLPs, but in both cases human factors were non-negligible error generators. Therefore, tracking genotyping errors and identifying their causes are necessary to clean up the data sets and validate the final results according to the precision required. In addition, we propose the outline of a protocol designed to limit and quantify genotyping errors at each step of the genotyping process. In particular, we recommend (i) several efficient precautions to prevent contaminations and technical artefacts; (ii) systematic use of blind samples and automation; (iii) experience and rigor for laboratory work and scoring; and (iv) systematic reporting of the error rate in population genetics studies.
منابع مشابه
Haplotype frequency estimation in the presence of genotyping errors.
Several statistical methods have been proposed to estimate haplotype frequencies, either based on unrelated individuals or based on families. These estimates may yield insights on population genetics as well as associations between candidate regions and disease of interest. One limitation of the existing methods is that all these methods make the implicit assumption that there are no genotyping...
متن کاملLinkage Disequilibrium-Based Quality Control for Large-Scale Genetic Studies
Quality control (QC) is a critical step in large-scale studies of genetic variation. While, on average, high-throughput single nucleotide polymorphism (SNP) genotyping assays are now very accurate, the errors that remain tend to cluster into a small percentage of "problem" SNPs, which exhibit unusually high error rates. Because most large-scale studies of genetic variation are searching for phe...
متن کاملMicrosatellites behaving badly: empirical evaluation of genotyping errors and subsequent impacts on population studies.
Microsatellites are useful tools for ecological studies because they can be used to discern population structure, dispersal patterns and genetic relationships among individuals. However, they can also yield inaccurate genotypes that, in turn, bias results, promote biological misinterpretations, and create repercussions for population management and conservation programs. We used empirical data ...
متن کاملGenotyping error detection through tightly linked markers.
The identification of genotyping errors is an important issue in mapping complex disease genes. Although it is common practice to genotype multiple markers in a candidate region in genetic studies, the potential benefit of jointly analyzing multiple markers to detect genotyping errors has not been investigated. In this article, we discuss genotyping error detections for a set of tightly linked ...
متن کاملHeterozygosis deficit of polymorphic markers linked to the β-globin gene cluster region in the Iranian population
Objective(s): Iran is considered as one of the high-prevalence areas for β-thalassemia with a rate of about 10% carrier frequency. Molecular diagnosis of the disease is performed both by direct sequencing and indirectly by the use of polymorphic markers present in the beta globin gene cluster. However, to date there is no reliable information on the application of the markers in the Iranian pop...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Molecular ecology
دوره 13 11 شماره
صفحات -
تاریخ انتشار 2004